GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f rame__plus_p2n model 



Run on: April 21, 2004, 07:44:27 ; Search time 395.202 Seconds 

(without alignments) 
2698.108 Million cell updates/sec 

Title: US-08-737-319{l) frans/ccM ~* Stcu^C^ J f\ 7>tiA ol dtCt IoOUS 

Perfect score: 1341 ^ 

Sequence: 1 MSMPNIVPPAEVRTEGLSLE KGEVDAKSLEDLSDNKVWKM 251 



Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 

Searched: 3373863 seqs, 2124099041 residues 

Total number of hits satisfying chosen parameters: 6747726 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL-f rame+_p2n. model -DEV-xlh 

Q=/cgn2_l/USPTO_spool/US08737319/runat_15042004_131640_26696/app_query. f asta_l . 8 
46 

-DB=N_Geneseq_2 9Jan04 -QFMT=fastap -SUFFIX=p2n . rng -MINMATCH— 0 . 1 -LOOPCL=0 
-LOOPEXT=0 -UNITS-bits -START-1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi 
-LIST=45 -DOCALIGN-200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
-MODE=LOCAL -OUTFMT=pto -NORM-ext -HEAPSI ZE=500 -MINLEN-0 -MAXLEN=2000000000 
-USER=US087 37319_@CGN_l_l_58 6_@runat_1504 2 004_13164 0_2 6696 -NCPU-6 -ICPU-3 
-NO_MMAP -LARGEQUERY -NEG__SCORES=0 -WAIT -DSPBLOCK=l 00 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_T±MEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT-7 -YGAPOP=10 -YGAPEXT=0.5 -DELOP-6 -DELEXT-7 



Database 



N_Geneseq_2 9Jan04 : * 



9 

10 



genes eqnl 980s : * 
geneseqnl990s : * 
geneseqn2000s : * 
geneseqn2001as : * 
geneseqn2001bs : * 
geneseqn2002s : * 
geneseqn2003as : * 
geneseqn2003bs : * 
geneseqn2003cs : * 
geneseqn2004s : + 



0-' 



ft/ Ay 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 0^ 
and is derived by analysis of the total score distribution. Arfl*' 

SUMMARIES H U 

Result Query QdXU « 

No. Score Match Length DB ID De1s*cription 



dt 1 
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1341 


100 , 


. 0 


1100 


2 


AAT4 0216 Alirf £J0 




t a n 9 1 


Q r~\ /~ri ionics 
OfcJLJLitrllCe 
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1341 


100 , 


. 0 
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2 


AAT72945 CJO 


A ;=i "h 7 9 Q 4 S 


Ph A f f i a r* 
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3 


819 


61 . 


, 1 


8 4 6 


9 
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AHhfiQ fifiO 

r\\J.kJ U JUOU 


U . I lfc; UIUI 
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51 . 


. 8 
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9 


ADB69299 




AHhfi Q9 Q Q 


^ . IlcUlUI 
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. 8 


3019 
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tt.Hi-J UO ^JO 


C T"» f a y 

• IlcUlUI 


6 


640.5 


47 . 


. 8 


816 


7 


ABT19254 




Aht 1 99 S4 


tt.OLJtiXLjX._LX 


7 


629 


46 . 


, 9 


855 


6 


ABZ32187 




Ah^^91 R7 

r\.U i, JZ, 1 O / 


L^clllLi-LLlcl d 


8 


627 . 5 


46 . 


. 8 


816 


7 


ABT21074 




Aht 9 1 (174 


A o r^v o "K~ rr i 1 1 
nojJt, L y ill 


9 


623 


46 . 


. 5 


8400 


6 


AAD31029 




A^rHI 09Q 




10 


623 


46 , 


. 5 


14 623 


6 


AAD31039 




rt-d U J 1 U J J 


rldo LIU L 


11 


623 


46. 


. 5 


14623 


6 
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pi^otiH t 

r ± a o I L i_l L. 


12 


621 


46, 


, 3 
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6 
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rlClLlJ _L v J J. J7 


uLlllZ.UodL 


13 


579.5 


43 , 


, 2 


1807 


6 


ABK84546 




AhVR 4 S4 6 




14 


579.5 


43 t 


, 2 


1807 


6 


ABN97217 




Ahn Q7 9 1 7 

tt.U ID / Z, X / 


belle ff O I X 


15 


579.5 


43 . 


, 2 


1807 


7 


ACA8 9903 




.tt.l_.ciO _7_7VJO 


oeiie uiii 


16 


579.5 


43 . 


. 2 


1920 


6 


ABQ93307 




tt.XJ U ^ J JU / 


Hninan r -1 PT\T 
n LiH Let 11 


17 


578 . 5 


43, 


, 1 


867 


6 


ABK96803 




Abk968 0 3 


Q (^(^ V-(=lTT "I 
l_> • \Z; J_ G V J_ 


18 


578 . 5 


43 . 


, 1 


1058 


2 


AAT40218 




Aat 4 091 ft 




19 


571.5 


42 . 


. 6 


822 


2 


AAX3 98 8 6 




nd A O _? O O U 


uao i — L 1L (_ 


20 


560 . 5 


41 . 


. 8 


1182 


7 


ABT42305 




Aht 4 9 "30 S 

rllJ L4Z, JU J 


Tovi r^i tw 

1 LJ2iXL-r-L Ly 


21 


560 . 5 


41 . 


, 8 


1182 


9 


ADB56191 




AHhS61 91 


T A V 1 C* ~\ t \ 7 

i uaili t_ y 


22 


560 . 5 


41 . 


, 8 


1182 


9 


ADB53451 




Adb534 51 


i- J_ _L ILld. _L y J_ 


23 


552 . 5 


41 . 


, 2 


2957 


7 


ABT18066 




Aht 1 8066 


tt.o jJcIUlll 


24 


552 . 5 


41 . 


, 2 


2958 


7 


ABT19880 




Aht 1 Qftftfl 
tt_U Li^OOU 


A r7« y-N /^f 1 1 "I 

.Hoper cjiii 


25 


552 


4 \ 1 


2 


1 971 


-5 


AAZ45588 




i-ia z f± o o o o 


Nucleot id 


26 


552 


4 1 . 


, 2 


1271 


6 


ABS71576 




Ahc;7 1 S7 

ttXJ o / X ^> / D 


7\ x~\ a 1 aoe 
tt.. ]JclXclt;c> 


27 


547 


4 0 . 


, 8 


i 009 




AAZ45587 






Nucleot id 


28 


547 


40. 


,8 


1002 


6 


ABS71575 




Abs71575 


A. palaes 


29 


546.5 


40. 


,8 


958 


7 


ABT18660 




Abtl8660 


Aspergill 


30 


546.5 


40. 


.8 


958 


7 


ABT20476 




Abt20476 


Aspergill 


31 


541 


40. 


,3 


1031 


3 


AAZ45594 




Aaz45594 


Nucleotid 


32 


541 


40. 


3 


1031 


6 


ABS71582 




Abs71582 


0. sativa 


33 


534.5 


39. 


9 


996 


3 


AAZ45581 




Aaz45581 


cDNA enco 


34 


534.5 


39. 


9 


996 


6 


ABS71569 




Abs71569 


A. thalia 


35 


532 


39. 


7 


990 


3 


AAC47123 




Aac47123 


Arabidops 


36 


530.5 


39. 


6 


855 


6 


ABZ12767 




Abzl2767 


Arabidops 


37 


530.5 


39. 


6 


880 


3 


AAZ98352 




Aaz98352 


A. thalia 


38 


529.5 


39. 


5 


988 


3 


AAZ45591 




Aaz45591 


Nucleotid 


39 


529.5 


39. 


5 


988 


6 


ABS71579 




Abs71579 


L. sativa 


40 


528.5 


39. 


4 


876 


6 


ABZ12487 




Abzl2487 


Arabidops 


41 


527 


39. 


3 


954 


2 


AAT95373 




Aat95373 


Arabidops 


42 


527 


39. 


3 


954 


3 


AAZ45580 




Aaz45580 


cDNA enco 


43 


527 


39. 


3 


954 


6 


ABS71568 




Abs71568 


A. thalia 


44 


525 


39. 


1 


698 


3 


AAF14664 




Aafl4664 


Aspergill 


45 


520.5 


38. 


8 


985 


3 


AAZ45590 




Aaz45590 


Nucleotid 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



April 21, 2004, 07:44:27 



Search time 3990.9 Seconds 

(without alignments) 

2725.977 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-08-737-319-1 
1341 

1 MSMPNIVPPAEVRTEGLSLE 



KGEVDAKSLEDLSDNKVWKM 251 



Scoring table: 



BLOSUM62 



Xgapop 10.0 , 

Ygapop 10.0 , 

Fgapop 6.0 , 

Delop 6.0 , 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7.0 
7.0 



Searched : 



3470272 seqs, 



21671516995 residues 



Total number of hits satisfying chosen parameters: 



6940544 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-MODEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool/US08737319/runat_15042004_131641_26703/app_query. fasta_l . 8 
46 

-DB=GenEmbl -QFMT=fastap -SUFTIX=p2n . rge -MINMATCH-0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START-1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT=pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-USER=US08737319_@CGN_l_l_3077_@runat_15042004_131641_26703 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT==120 -WARN_TIMEOUT=30 -THREADS=1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT=7 

Database : GenEmbl : * 



1: 
2: 
3: 
4 : 
5: 
6: 
7: 
8: 
9: 
10 



gb_ba: * 
gb_htg: * 
gb_in : * 
gb_om : * 
gb_ov : * 
gb_pat : * 
gb_ph : * 
gb__pl : * 
gb_pr : * 
gb_ro : * 



11 


gb_ 


sts : * 


12 


gb 


sy : * 


13 


gb 


un : * 


14 


gb_ 


vi : * 


15 


em 


~ba: * 


16 


em 


fun : * 


17 


em 


hum: * 


18 


em 


_in: * 


19 


em 


mu: * 


20 


em 


om: * 


21 


em 


_or : * 


22 


em 


ov; * 


23 


em 


pat : * 


24 


em 


ph: * 


25 


em 


_pl:* 


26 


em 


ro : * 


27 


em 


_sts : * 


28 


em 


un : * 


29 


em 


vi : * 


30 


em 


htg hum: * 


31 


em 


htg inv:* 


32 


em 


htg other:* 


33 


em 


htg mus : * 


34 


em 


htg pin: * 


35 


em 


htg rod:* 


36 


em 


htg mam: * 


37 


em 


htg vrt : * 


38 


em 


sy : * 


39 


em 


htgo hum:* 


40 


em 


htgo mus : * 


41 


em 


htgo other:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


1341 


100 


0 


1084 


8 


AB019035 /W ~ 


/AB019035jxanthophy 


2 


1341 


100 


0 


1099 


6 


E12045 Tf miMft%/ 


E12045 cDNA encodi 


3 


1341 


100 


0 


1099 


6 


BD014674 3f Lt»i c^^^ 


BD014674 Method of 


4 


1341 


100 


0 


1165 


6 


A638 95 Uo <Wfm}3i/ 


A63895 Sequence 20 


5 


1341 


100 


0 


1165 


6 


AR366158 6*3 tJilHL QM t ai) 


2&R366158 Sequence 


6 


1167 


87 


0 


3239 


8 


PRY15811 tWI 


Y15811 Xanthophyll 


7 


695 


51* 


8 


93593 


8 


AF263283 


AF263283 Filobasid 


8 


695 


51 


8 


93979 


8 


AC068564 


AC068564 Filobasid 


9 


637 


47 


5 


1017 


8 


AF479816 


AF479816 Aspergill 


10 


629 


46 


9 


855 


6 


AX489174 


AX489174 Sequence 


11 


621 


46 


3 


931 


8 


SPU21154 


U21154 Schizosacch 


12 


579.5 


43 


2 


687 


9 


BT006761 


BT006761 Homo sapi 


13 


579. 5 


43 


2 


687 


12 


BT007937 


BT007937 Synthetic 


14 


579. 5 


43 


2 


1288 


9 


BC019227 


BC019227 Homo sapi 


15 


579. 5 


43 


2 


1773 


9 


AF2 71720 


AF271720 Homo sapi 


16 


579. 5 


43 


2 


1807 


6 


AX411068 


AX411068 Sequence 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: April 21, 2004, 07:44:29 / Search time 511.316 Seconds 

(without alignments) 
9690.649 Million cell updates/sec 



Title: US-08-737-31 9-4 

Perfect score: 1099 

Sequence: 1 CCCACGCGTCCGCACATCTC AC T AC AAAAAAAAAAAAAAA 1099 

Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

Searched: 2907579 seqs, 2254313464 residues 

Total number of ^ hits satisfying chosen parameters: 5815158 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_NA: * 

1 : /cgn2_6/ptodata/l/pubpna/US07_PUBCOMB.seq: * 
2 : /cgn2_6/ptodata/l/pubpna/PCT_NEW_PUB.seq: * 
3 : /cgn2_6/ptodata/l/pubpna/US06_NEW_PUB. seq: * 
4 : /cgn2_6/ptodata/l/pubpna/US06_PUBCOMB.seq: * 
5 : /cgn2_6/ptodata/l/pubpna/US07_NEW__PUB.seq: * 
6: /cgn2_6/ptodata/l/pubpna/PCTUS_PUBCOMB. seq: * 
7 : /cgn2_6/ptodata/l/pubpna/US08_NEW_PUB.seq: * 
8 : /cgn2_6/ptodata/l/pubpna/US08__PUBCOMB. seq: * 
9: /cgn2_6/ptodata/l/pubpna/US09A_PUBCOMB.seq: * 
10: /cgn2_6/ptodata/l/pubpna/US09B_PUBCOMB. seq:* 
11 : /cgn2_6/ptodata/l/pubpna/US09C_PUBCOMB. seq: * 
12 : /cgn2_6/ptodata/l/pubpna/US09_NEW_PUB.seq: * 
13: /cgn2_6/ptodata/l/pubpna/US0 9_NEW__PUB. seq2: * 
14 : /cgn2_6/ptodata/l/pubpna/US10A_PUBCOMB. seq: * 
15 : /cgn2_6/ptodata/l/pubpna/US10B_PUBCOMB. seq: * 
16: /cgn2_6/ptodata/l/pubpna/US10C_PUBCOMB. seq: * 
17 : /cgn2_6/ptodata/l/pubpna/US10__NEW_PUB. seq: * 
18 : /cgn2_6/ptodata/l/pubpna/US60_NEW_PUB. seq: * 
19: /cgn2_6/ptodata/l/pubpna/US60_PUBCOMB. seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

o 
o 

Result Query 



No. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


202. 6 


1 Q 


A 


846 


16 


US-10-320-797-2065 


Sequence 2065, Ap 


2 


182.2 


1 O . 


U 


816 


15 


US-10-128-714-2424 


Sequence 24 24, Ap 


3 


179 


I O . 




816 


15 


US-10-128-714-7424 


Sequence 7424, Ap 


4 


126.2 


1 1 
1 1 . 


c; 


684 


10 


US-09-918-740-23 


Sequence 23, Appl 


5 


126.2 


1 I . 


c: 
O 


684 


10 


US-09-918-740-54 


Sequence 54, Appl 


6 


126.2 


1 1 


(Z. 

3 


8400 


10 


US-09-918-740-64 


Sequence 64, Appl 


7 


126.2 


I 1 

I I . 


D 


14623 


10 


US-09-918-740-74 


Sequence 74, Appl 


8 


126.2 


1 1 
X X . 


c 

D 


14623 


10 


US-09-918-740-76 


Sequence 7 6, Appl 


9 


125 


1 1 
X X . 


A 
H 


1002 


9 


US-09-323-998D-28 


Sequence 28, Appl 


10 


124.2 


1 1 
X X . 




1271 


9 


US-09-323-998D-29 


Sequence 29, Appl 


11 


121. 4 


1 1 
X X . 


U 


1031 


9 


US-09-323-998D-35 


Sequence 35, Appl 


12 


114.8 


X u . 


/I 


985 


9 


US-09-323-998D-31 


Sequence 31, Appl 


13 


112. 8 


x u . 




1182 


12 


US-10-152-319A-2007 


Sequence 2007, Ap 


14 


112.8 


i n 
x u . 


~3 


1182 


16 


US-10-388-934-12 


Sequence 12, Appl 


15 


111. 6 


XU . 


o 


1807 


9 


US-09-880-107-3712 


Sequence 3712, Ap 


16 


111. 6 


x u . 


o 

, Z 


1920 


13 


US-10-363-616-20 


Sequence 20, Appl 


17 


111. 6 


i n 

X u . 


o 

, z 


2059 


10 


US-09-814-353-20314 


Sequence 20314, A 


18 


111 


i n 
x u . 


, X 


876 


9 


US-09-938-842A-292 


Sequence 292, App 


19 


111 


X u . 


1 

. X 


876 


11 


US-09-938-842A-292 


Sequence 2 92, App 


20 


109.8 


x u , 


. u 


954 


9 


US-09-323-998D-34 


Sequence 34, Appl 


21 


109. 4 


1 n 
x u , 




855 


9 


US-09-938-842A-572 


Sequence 572, App 


22 


109.4 


X u . 


. u 


855 


11 


US-09-938-842A-572 


Sequence 57 2, App 


23 


109. 4 


XU . 


. U 


880 


15 


US-10-342-224-95 


Sequence 95, Appl 


24 


109. 4 


1 n 

X u , 


. u 


954 


9 


US-09-323-998D-9 


Sequence 9, Appli 


25 


109.4 


X U , 


. u 


996 


9 


US-09-323-998D-10 


Sequence 10, Appl 


26 


107.2 


Q 

y . 


Q 

. o 


831 


13 


US-10-424-599-13178 


Sequence 13178, A 


27 


102. 6 


y . 




3203 


9 


US-09-925-302-142 


Sequence 142, App 


28 


102. 6 


y 


-D 

. j 


3203 


13 


US-09-925-302-142 


Sequence 142, App 


29 


101.8 


y 


o 

. o 


867 


16 


US-10-369-493-46169 


Sequence 46169, A 


30 


99.2 


Q 

y 


A 

. u 


453 


10 


US-09-918-995-3039 


Sequence 3039, Ap 


31 


98.8 


Q 


. U 


988 


9 


US-09-323-998D-32 


Sequence 32, Appl 


32 


89.6 


Q 

o 


. z 


1019 


16 


US-10-320-797-1065 


Sequence 1065, Ap 


33 


89.6 


8 


.2 


3019 


16 


US-10-320-797-65 


Sequence 65, Appl 


34 


89.4 


8 


.1 


958 


15 


US-10-128-714-1424 


Sequence 1424, Ap 


35 


89.4 


8 


. 1 


958 


15 


US-10-128-714-6424 


Sequence 6424, Ap 


36 


89.4 


8 


. 1 


2957 


15 


US-10-128-714-424 


Sequence 4 24, App 


37 


89.4 


8 


. 1 


2958 


15 


US-10-128-714-5424 


Sequence 5424, Ap 


38 


86.4 


7 


. 9 


1332 


16 


US-10-120-988-353 


Sequence 353, App 


39 


86.4 


7 


.9 


1359 


16 


US-10-094-749-1226 


Sequence 1226, Ap 


40 


86.4 


7 


. 9 


1378 


15 


US-10-103-313-24 


Sequence 24, Appl 


41 


86.4 


7 


. 9 


1453 


15 


US-10-103-313-152 


Sequence 152, App 


42 


77.4 


7 


.0 


775 


16 


US-10-369-493-27183 


Sequence 27183, A 


43 


74.6 


6 


.8 


827 


16 


US-10-369-493-36552 


Sequence 36552, A 


44 


73 


6 


. 6 


1874 


9 


US-09-323-998D-33 


Sequence 33, Appl 


45 


72.6 


6 


. 6 


788 


13 


US-10-425-114-1629 


Sequence 1629, Ap 



ALIGNMENTS 



RESULT 1 

US-10-320-797-2065 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: 



April 21, 2004, 07:44:27 ; Search time 4118.1 Seconds 

(without alignments) 
2725.977 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-08-737-319/2 L 
1398 

1 MQLLAEDRTDHMRGASTWAG 




AALNTDKHEDWGTVHHINEA 25 9 



Scoring table: 



BLOSUM62 



Xgapop 10.0 
Ygapop 10.0 
Fgapop 6 . 0 
Delop 6.0 



Xgapext 
Ygapext 
Fgapext 
Delext 



0.5 
0.5 
7.0 
7.0 



Searched: 



3470272 seqs 



21671516995 residues 



Total number of hits satisfying chosen parameters: 



6940544 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-M0DEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2_l/USPTO_spool/US08737319/runat_15042004_131641_26703/app_query. fasta_1.8 
46 

-DB=GenEmbl -QFMT=fastap -SUFFIX=p2n . rge -MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 
-UNITS=bits -START-1 -END=-1 -MATRIX=blosum62 -TRANS-human4 0 . cdi -LIST=45 
-DOCALIGN=200 -THR_SCORE-pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 -MODE=LOCAL 
-OUTFMT-pto -NORM=ext -HEAPSIZE=500 -MINLEN=0 -MAXLEN=2000000000 
-USER-US08737319_@CGN_l_l_3077_@runat_15042004_131641_2 6703 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG__SCORES=0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEVJTIMEOUT=120 -WARN_TIMEOUT=30 -THREADS-1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP-10 -YGAPEXT-0.5 -DELOP=6 -DELEXT=7 

Database : GenEmbl : * 



1: 
2: 
3: 
4 : 
5: 
6: 
7 : 
8: 
9: 
10 



gb_ba : * 
gb_htg : * 
gb_in : * 
gb_om : * 
gb_ov : * 
gb_pat : * 
gb_ph : * 
gb_pl : * 
gb_pr : * 
gb_ro : * 




11: 


gb sts:* 


12: 


gb sy:* 


13: 


gb un : * 


14 : 


gb vi : * 


15: 


em ba : * 


16: 


em fun:* 


17: 


em hum : * 


18: 


em in:* 


19: 


em mu : * 


20: 


em om : * 


21: 


em or : * 


22: 


em ov : * 


23: 


em pat : * 


24 : 


em ph : * 


25: 


em pi : * 


26: 


em ro : * 


27: 


em sts:* 


28: 


em un : * 


29: 


em vi : * 


30: 


em htg hum:* 


31: 


em htg inv:* 


32: 


em htg other: 


33: 


em htg mus : * 


34: 


em htg pin : * 


35: 


em htg rod: * 


36: 


em htg mam: * 


37: 


em htg vrt : * 


38: 


em sy:* 


39: 


em htgo hum:* 


40: 


em htgo mus : * 


41: 


em htgo other 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



Result 
No. 



SUMMARIES 



Score 



Query 

Match Length DB ID 



De/scription 



1 


1398 


100. 


0 


1058 


2 


1398 


100. 


0 


1074 


3 


1398 


100. 


0 


1074 


4 


1394 


99. 


7 


1099 


5 


1394 


99. 


7 


1109 


6 


1394 


99 


7 


1109 


7 


1383 


98 


9 


1135 


8 


1383 


98 


9 


1135 


9 


1383 


98 


9 


1135 


10 


1368 


97 


9 


1150 


11 


1368 


97 


9 


1165 


12 


1368 


97 


9 


1165 


13 


1368 


97 


9 


1165 


14 


777.5 


55 


6 


1860 


15 


777.5 


55 


6 


1874 


16 


777.5 


55 


6 


1874 



o (AB019034 J 
6 E12046 JP rtV.Z<42 S <*t 

6 BD014675 J f Z^o| 

8 AF082326 - /ft ? 

6 pAR428559 - <+LH2A%( 

6r*BD223743 Jf ^oot 

6 -AR003639 S>V«/5YI 

6 JpAR428553 

6 ,^D223737 

8 AF082325-I^* 

6 -AR003638 

6 /UAR428552 




AB019034 Haematoco 
e/2046 cDNA encodi 
)014675 Method of 
VF082326 Haematoco 
'AR428559 Sequence 
-&B2-M7 4J Series oi~~ 



-BD223736 
AF082869 
AR428562 
BD223746 



(£kR003639 Sequence 
AR428553 Sequence 
BD223737 Genes of 
AF082325 Haematoco 
AR003638 Sequence 
AR428552 Sequence 
BD223736 Genes of 
AF0828 69 Chlamydom 
AR428562 Sequence 
BD223746 Genes of 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - nucleic search, using f rame_plus_p2n model 



Run on: April 21, 2004, 07:44:27 ; Search time 407.798 Seconds 

(without alignments) 
2698.108 Million cell updates/sec 

Title: US-08-737-319-2 
Perfect score: 1398 

Sequence: 1 MQLLAEDRTDHMRGASTWAG AALNTDKHEDWGTVHHINEA 259 



Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 

Searched: . 3373863 seqs, 2124099041 residues 

Total number of hits satisfying chosen parameters: 6747726 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Command line parameters: 
-M0DEL=frame+_p2n. model -DEV=xlh 

Q=/cgn2__l/USPTO_spool/US08737319/runat_15042004_131640_26696/app_query. fasta_l .8 
46 

-DB-N_Geneseq_2 9Jan04 -QFMT-fastap -SUFFIX=p2n . rng -MINMATCH-0 . 1 -LOOPCL-0 
-LOOPEXT=0 -UNITS-bits -START=1 -END=-1 -MATRIX=blosum62 -TRANS=human4 0 . cdi 
-LIST=45 -DOCALIGN=200 -THR_SCORE=pct -THR_MAX=100 -THR_MIN=0 -ALIGN=15 
-MODE=LOCAL -OUTFMT=pto -NORM=ext -HEAPSIZE-500 -MINLEN=0 -MAXLEN=2 000000000 
-USER=US08737 319_@CGN_l_l_586_@runat_15042004_13164 0_2 6696 -NCPU=6 -ICPU=3 
-NO_MMAP -LARGEQUERY -NEG_SCORES-0 -WAIT -DSPBLOCK=100 -LONGLOG 

-DEV_TIMEOUT=120 -WARN_TIMEOUT=30 -THREADS-1 -XGAPOP=10 -XGAPEXT=0 . 5 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0 . 5 -DELOP=6 -DELEXT-7 



Database : N_Geneseq__2 9Jan04 : * 

1: geneseqnl 980s : * 

2: geneseqnl 990s : * 

3: geneseqn2000s : * 

4: geneseqn2001as : * 

5: geneseqn2001bs : * 

6: geneseqn2002s : * 

7: geneseqn2003as : * 

8: geneseqn2003bs : * 

9: geneseqn2003cs : * 

10 : geneseqn2004s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



Result 
No. 



SUMMARIES 



Score 



Query 

Match Length DB 



ID 



1 


1398 


100 . 


o 


2 


1394 


99 . 


7 


3 


1394 


99 . 


7 


4 


1383 


98 . 


9 


5 


1383 


98 . 


9 




1383 


98 . 


9 


7 


1368 


97 . 


9 


8 


1368 


97 


Q 

ZJ 


9 


1368 


97 . 


9 


10 


777.5 


S S 

hJ -J . 


\j 


11 


777.5 


55 . 


5 


12 


538 


38 . 


5 


13 


527 


37 . 


7 


14 


527 


37 . 


7 


15 


527 


37 . 


7 


16 


527 


37 . 


7 


17 


526 


37 . 


6 


18 


526 


37 . 


6 


19 


526 


37 . 




20 


523 


37 


4 


21 


522 


.j / * 




00 

z z 


s? 0 


17 




Z J 




17 




24 


-J z z 


17 




25 


522 


17 




0 

z u 


J z z 


17 


-J 


01 
z / 


O z 1 


17 




28 


521 


37 . 


3 


29 


521 


37. 


3 


30 


517 


37. 


0 


31 


517 


37 . 


0 


32 


517 


37. 


0 


33 


517 


37. 


0 


34 


516 


36. 


9 


35 


516 


36. 


9 


36 


515 


36. 


8 


37 


515 


36. 


8 


38 


513 


36. 


7 


39 


510 


36. 


5 


40 


510 


36. 


5 


41 


505 


36. 


1 


42 


501 


35. 


8 


43 


497.5 


35. 


6 


44 


496 


35. 


5 


45 


496 


35. 


5 



1134 2 AAT40217 £J0 qtt/tSSys 
1109 3 pkAZ45589 1<i/t3&SS 
1109 6r-ABS7157 7US/V^ ZOOZfOtUH 

1135 j£--ftAT95391 OoWflW&V^ 
1135 3 AAZ4558360 f W*X ^ 
1135 6f-rABS71571 

1165 *2--AAT95390 

1165 3 *AAZ45582 

1165 6L-ABS71570 

1874 3 AAZ45592 

1874 6 ABS71580 

816 7 ABT19254 

988 3 AAZ45591 

988 6 ABS71579 

1031 3 7AAZ45594 

1031 6 ABS71582 

816 7 ABT21074 

876 6 ABZ12487 

990 3 AAC47123 

1020 3 AAA07585 

684 6 AAD31019 

996 3 AAZ45581 

996 6 ABS71569 

8400 6 AAD31029 

14623 6 AAD31039 

14623 6 AAD31041 

954 2 AAT95373 

954 3 AAZ45580 

954 6 ABS71568 

1002 3 AAZ45587 

1002 6 ABS71575 

1288 4 AAC62234 

1288 5 AAH44118 

985 3 AAZ45590 

985 6 ABS71578 

855 6 ABZ12767 

880 3 AAZ98352 

855 6 ABZ32187 

1271 3 AAZ45588 

1271 6 ABS71576 

1101 3 AAC46636 

850 4 ABL10211 

822 2 AAX39886 

867 6 ABK96803 

1058 2 AAT40218 




fl7 
fz45589 
Abs71577 



Sequence 
Nucleotid 
H. pluvia^ 



Aat95391 
Aaz45583 
Abs71571 
Aat95390 
Aaz45582 
Abs71570 
Aaz45592 
Abs71580 



Haeraatoco 
cDNA enco 
H. pluvia 
Haematoco 
cDNA enco 
H. pluvia 
Nucleotid 
C. reinha 



Abtl9254 Aspergill 
Aaz45591 Nucleotid 



Abs71579 
Aaz45594 
Abs71582 
Abt21074 



L. sativa 
Nucleotid 
O. sativa 
Aspergill 



Abzl2487 Arabidops 
Aac47123 Arabidops 
Aaa07585 Marigold 
Aad31019 Schizosac 
Aaz45581 cDNA enco 
Abs71569 A. thalia 
Aad31029 Operon G 
Aad31039 Plastid t 
Aad31041 Plastid t 
Aat95373 Arabidops 
Aaz45580 cDNA enco 
Abs71568 A. thalia 
Aaz45587 Nucleotid 
Abs71575 A. palaes 



Aac62234 
Aah44118 
Aaz45590 
Abs71578 



DNA encod 
Hevea bra 
Nucleotid 
L. sativa 



Abzl2767 Arabidops 
Aaz98352 A. thalia 



Abz32187 
Aaz45588 
Abs71576 
Aac46636 



Candida a 
Nucleotid 
A. palaes 
Zea mays 



Abll0211 Drosophil 
Aax39886 Gastric c 
Abk96803 S. cerevi 
Aat40218 Sequence 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic 
Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



nucleic search, using sw model 

April 21, 2004, 07:44:29 ; Search time 499.684 Seconds 

(without alignments) 
9690.649 Million cell updates/sec 

US-08-737-319-5 
1074 

1 ATCGCTACTTGGAACCTGGC CTGGAAAAAAAAAAAAAAAA 107 4 

I DENT I T Y_NUC 
Gapop 10.0 , Gapext 1.0 



Searched: 2907579 seqs, 2254313464 residues 

Total number of hits satisfying chosen parameters; 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



5815158 



Post -processing : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database : 



Published_Applications_NA: * 

/cgn2_6/ptodata/l/pubpna/US07_PUBCOMB. se 
/cgn2_6/ptodata/l/pubpna/PCT_NEW_PUB. seq 
/cgn2_6/ptodata/l/pubpna/US06_NEW_PUB. se 
/cgn2_6/ptodata/l/pubpna/US06_PUBCOMB. se 
/cgn2_6/ptodata/l/pubpna/US07_NEW_PUB . se 
/cgn2_6/ptodata/l/pubpna/PCTUS_PUBCOMB.s 
/cgn2_6/ptodata/l/pubpna/US08_NEW_PUB.se 
/cgn2_6/ptodata/l/pubpna/US08_PUBCOMB. se 
/cgn2_6/ptodata/l/pubpna/US09A_PUBCOMB.s 
/cgn2_6/ptodata/l/pubpna/US09B_PUBCOMB. 
/cgn2_6/ptodata/l/pubpna/US09C_PUBCOMB. 
/cgn2_6/ptodata/l/pubpna/US09_NEW_PUB.s 
/cgn2_6/ptodata/l/pubpna/US09_NEW_PUB. s 
/cgn2__6/ptodata/l/pubpna/US10A_PUBCOMB 
/cgn2_6/ptodata/l/pubpna/US10B_PUBCOMB. 
/cgn2_6/ptodata/l/pubpna/US10C_PUBCOMB. 
/cgn2_6/ptodata/l/pubpna/US10_NEW_PUB. s 
/cgn2_6/ptodata/l/pubpna/US60_NEW_PUB. s 
/cgn2_6/ptodata/l/pubpna/US60_PUBCOMB. s 



10 
11 
12 
13 
14 
15 
16 
17 
18 
19 



q:* 

q:* 
q:* 
q:* 
eq: * 
q:* 
q:* 
eq : * 
seq: • 
seq: ■ 
eq: * 
eq2 : 
seq: ' 
seq: ■ 
seq: : 
eq: * 
eq: * 
eq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 



lo. 


Score 


Match 


Length 


DB 


ID 


Description 


1 


931 


86 . 


7 


1109 


9 


US-09-323-998D-30 


Sequence 30, Appl 


2 


925. 6 


86 . 


2 


1135 


9 


US-09-323-998D-12 


Sequence 12, Appl 


3 


878 


81 . 


8 


1165 


9 


US-09-323-998D-11 


Sequence 11, Appl 


4 


236.2 


22 . 


0 


1874 


9 


US-09-323-998D-33 


Sequence 33, Appl 


5 


107.2 


10 . 


0 


1031 


9 


US-09-323-998D-35 


Sequence 35, Appl 


6 


94 


8 . 


8 


816 


15 


US-10-128-714-2424 


Sequence 2424, Ap 


7 


90.8 


8 . 


. 5 


591 


15 


US-10-156-761-1652 


Sequence 1652, Ap 


8 


90.8 


8 . 


, 5 


816 


15 


US-10-128-714-7424 


Sequence 7424, Ap 


9 


90.8 


8 . 


. 5 


9025608 


15 


i US-10-156-761-1 


Sequence 1, Appli 


10 


82.8 


7 . 


, 7 


831 


13 


US-10-424-599-13178 


Sequence 13178, A 


11 


82 


7 . 


, 6 


876 


9 


US-09-938-842A-292 


Sequence 292, App 


12 


82 


7 . 


. 6 


876 


11 


US-09-938-842A-292 


Sequence 2 92, App 


13 


82 


7 , 


. 6 


996 


9 


US-09-323-998D-10 


Sequence 10, Appl 


14 


80.8 


7 , 


. 5 


1002 


9 


US-09-323-998D-28 


Sequence 28, Appl 


15 


80.4 


7 . 


, 5 


855 


9 


US-09-938-842A-572 


Sequence 572, App 


16 


80.4 


7 . 


, 5 


855 


11 


US-09-938-842A-572 


Sequence 572, App 


17 


80.4 


7 ( 


. 5 


880 


15 


US-10-342-224-95 


Sequence 95, Appl 


18 


80.4 


7 , 


, 5 


954 


9 


US-09-323-998D-9 


Sequence 9, Appli 


19 


79.4 


7 . 


, 4 


985 


9 


US-09-323-998D-31 


Sequence 31, Appl 


20 


79 


7 , 


, 4 


1271 


9 


US-09-323-998D-29 


Sequence 2 9, Appl 


21 


78.4 


7 . 


. 3 


775 


16 


US-10-369-493-27183 


Sequence 27183, A 


22 


74.4 


6. 


, 9 


684 


10 


US-09-918-740-23 


Sequence 23, Appl 


23 


74.4 


6 . 


, 9 


684 


10 


US-09-918-740-54 


Sequence 54, Appl 


24 


74.4 


6, 


. 9 


8400 


10 


US-09-918-740-64 


Sequence 64, Appl 


25 


74.4 


6 . 


. 9 


14623 


10 


US-09-918-740-74 


Sequence 74, Appl 


26 


74.4 


6. 


. 9 


14623 


10 


US-09-918-740-76 


Sequence 7 6, Appl 


27 


73.6 


6, 


, 9 


958 


15 


US-10-128-714-1424 


Sequence 1424, Ap 


28 


73.6 


6 , 


. 9 


958 


15 


US-10-128-714-6424 


Sequence 6424, Ap 


29 


73.6 


6 . 


. 9 


2957 


15 


US-10-128-714-424 


Sequence 424, App 


30 


73.6 


6. 


, 9 


2958 


15 


US-10-128-714-5424 


Sequence 5 4 24, Ap 


31 


71.4 


6, 


. 6 


988 


9 


US-09-323-998D-32 


Sequence 32, Appl 


32 


70.2 


6, 


.5 


954 


9 


US-09-323-998D-34 


Sequence 34, Appl 


33 


70.2 


6, 


.5 


960 


9 


US-09-323-998D-13 


Sequence 13, Appl 


34 


70 


6, 


.5 


846 


16 


US-10-320-797-2065 


Sequence 2065, Ap 


35 


70 


6, 


.5 


1639 


17 


US-10-389-566-224 


Sequence 224, App 


36 


68.2 


6. 


. 4 


827 


16 


US-10-369-493-36552 


Sequence 36552, A 


37 


67 


6, 


.2 


1332 


16 


US-10-120-988-353 


Sequence 353, App 


38 


67 


6, 


.2 


1359 


16 


US-10-094-749-1226 


Sequence 1226, Ap 


39 


67 


6, 


.2 


1378 


15 


US-10-103-313-24 


Sequence 24, Appl 


40 


67 


6, 


.2 


1453 


15 


US-10-103-313-152 


Sequence 152, App 


41 


62.8 


5, 


.8 


409 


10 


US-09-918-995-17650 


Sequence 17650, A 


42 


60.4 


5, 


.6 


807 


16 


US-10-369-493-26033 


Sequence 26033, A 


43 


60.4 


5 


. 6 


807 


16 


US-10-369-493-26038 


Sequence 26038, A 


44 


59.4 


5, 


.5 


788 


13 


US-10-425-114-1629 


Sequence 1629, Ap 


45 


59.4 


5, 


.5 


820 


13 


US-10-425-114-16380 


Sequence 16380, A 



ALIGNMENTS 



RESULT 1 

US-09-323-998D-30 



